|
|
Accession Number |
TCMCG075C00147 |
gbkey |
CDS |
Protein Id |
XP_007046584.2 |
Location |
complement(join(514970..515108,515180..515233,515423..515494,515649..518755,518835..519093,519450..519736,519812..520147,520231..520372,520448..520668,520773..520934,521032..521190,521277..521576,522323..522586,522849..522932)) |
Gene |
LOC18610701 |
GeneID |
18610701 |
Organism |
Theobroma cacao |
|
|
Length |
1861aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007046522.2
|
Definition |
PREDICTED: DNA-directed RNA polymerase II subunit 1 [Theobroma cacao] |
CDS: ATGGATTTGCGATTCCCTTATTCTCCCGCGGAGGTCGCCAAGGTCCGCATGGTCCAGTTCGGCATCCTCAGCCCCGACGAAATCCGGCAAATGTCGGTGGTGCAGATTGAGCACGGTGAAACGACCGAGAGGGGAAAACCTAAAGTGGGCGGTTTAAGTGACCCACGGCTGGGCACAATTGATAGGAAGATGAAGTGTGAGACGTGTACGGCCAACATGGCAGAGTGTCCAGGGCATTTTGGGCACTTGGAGCTTGCCAAGCCTATGTTTCATATTGGGTTTATGAAAACTGTGCTCAGTATAATGAGATGTGTCTGCTTCAATTGCTCAAAAATTCTCGCTGATGAGGAAGAGCATAAATTTAAGCAGGCATTGAAAATAAAGAATCCAAAGAATAGGCTGAAAAAGATCTTAGATGCATGCAAGAATAAATCTAAATGTGAGGGCGGTGATGAAATTGATGTCCAAGGTCAAGATACAGAGGAGCCTGTAAAAAAGAGTCGTGGTGGCTGTGGTGCTCAGCAGCCAAAGCTGAGTATTGATGGTATGAAGATGATTGCAGAATATAAGCCCCAGAGGAAGAGGAATGATGATCAGGAACAACTTCCTGAACCTGTGGAAAGAAAACAGACTCTCACTGCTGAACGGGTTCTTAGTGTCTTGAAGAGGATAAGCGACGAGGACTGCCAACTATTAGGCTTAAATCCTAAGTTTGCCCGTCCAGATTGGATGATTCTTCAAGTACTTCCAATACCTCCACCTCCTGTTAGACCTTCTGTGATGATGGACACTTCGTCTAGGAGTGAGGATGACTTAACACATGCGTTGGCTATGATCATTCGCCACAATGAGAACTTGAGGAGACAGGAGAGAAATGGGTCACCTGCACATATCATATCTGAATTTGCGCAGTTGTTGCAGTTCCATGTAGCAACATATTTTGACAATGAGTTGCCTGGACTGCCAAGGGCTACGCAGAGATCTGGGAGACCTATTAAATCTATTTGTAGTAGGCTTAAGGCGAAAGAGGGCCGGATTAGGGGTAACTTGATGGGAAAAAGGGTTGACTTTTCAGCTCGAACGGTTATTACACCAGATCCAAATATCAATATTGATGAACTTGGAGTACCCTGGAGTATTGCTTTAAACCTCACGTATCCAGAAACTGTGACTCCATATAACATTGAGAGGTTGAAAGAGCTTGTGGAATATGGGCCTCATCCTCCACCTGGAAAGACTGGTGCTAAGTATATCATAAGAGATGATGGACAAAGGCTTGATCTTCGTTACCTGAAGAAAAGTAGTGATCATCACTTAGAGCTTGGTTATAAGGTGGAGCGGCATTTAAATGATGGAGATTTTGTCCTTTTTAACCGTCAACCTAGTCTCCATAAAATGTCAATCATGGGCCACAGAATCAGGATTATGCCGTACTCTACATTTCGTCTAAATTTGTCTGTTACTTCGCCATACAATGCTGATTTTGATGGTGATGAAATGAATATGCATGTTCCTCAATCATTTGAGACAAGAGCAGAGGTATTGGAGCTCATGATGGTGCCTAAGTGCATTGTGTCCCCTCAATCAAATAGGCCTGTTATGGGGATTGTCCAGGATACACTTCTAGGATGCCGTAAAATCACCAAAAGGGATACTTTCATAGAAAAGGATGTATTCATGAATATATTAATGTGGTGGGAGGACTTTGATGGAAAGGTTCCAGCTCCTGCAATTCTAAAGCCACGACCACTTTGGACTGGGAAGCAGGTTTTTAATCTTATCATTCCAAAGCAGATAAATCTGCTGAGAAATTCTGCATGGCATTCAGAAACTGAGACTGGGTTTATCACCCCTGGGGATACTCAAGTTCGAATAGAAAAAGGAGAGCTACTGTCTGGCACTCTTTGCAAGAAGGCACTTGGAACATCTTCTGGAAGTCTTATACATGTCATTTGGGAAGAGGTTGGTCCTGATGCAGCTCGGAAATTTTTGGGACATACACAATGGCTTGTTAATTATTGGCTGCTGCAGAATGCCTTTAGTATTGGTATTGGAGATACCATTGCTGATGCAGCAACAATGGAAAAAATTAATGAAACTATTTCAAAAGCCAAAGAGGAAGTGAAAAACCTCATTGTGAAAGCCCAAAACAAAGATTTAGAGCCTGAACCTGGAAGAACTATGATGGAATCTTTTGAAAACAAAGTTAACCAGGTGTTGAATAAAGCTCGTGACGATGCAGGAAACAGTGCACAAAAAAGTTTGTCAGAAAGCAACAACCTTAAGGCAATGGTTACTGCAGGATCCAAGGGAAGTTTCATTAACATATCCCAAATGACAGCTTGTGTGGGTCAGCAGAATGTTGAGGGAAAGCGAATCCCATTTGGGTTCATAGATCGTACATTGCCCCATTTTACCAAGGATGATTATGGCCCTGAAAGTCGTGGGTTTGTGGAGAACTCGTACTTGCGTGGGTTGACCCCACAAGAGTTCTTTTTCCATGCCATGGGTGGTAGAGAAGGTCTCATAGATACTGCGGTTAAGACTTCTGAGACTGGGTACATACAGAGGCGACTTGTGAAGGCAATGGAGGATATTATGGTTAAATATGATGGAACTGTGAGGAACTCATTGGGGGATGTTATTCAATTCCTCTATGGTGAAGATGGTATGGATTCTGTATGGATAGAATCTCAGAAGCTGGATTCTTTGAAGATGAAGAAATCAGAATTTGATAGGGTCTTCAGATATAATATAGATGATGAAAGCTGGAATCCAACTTCTTATATGTTACCAGAGCACATTGAAGATTTGAGAACTATCCAAGAATTGCGTGATGTATTCGAAGCTGAAGTTCAAAAACTTGAAGCTGATAGATACCAACTTGGAACAGAGATTGCAGTCACGGGTGATAGTAATTGGCCTTTGCCTGTTAACTTGAAGAGGCTTATCTGGAATGCACAGAAGACTTTTAAAGTTGATTTTAGAAGGGTGTCTGATTTGCACCCAGTGGAAATTGTAGATTCTGTTGATAAGCTCCAGGAAAGGTTAAAGGTTGTTCCGGGTACAGATCCTCTGAGTGTGGAAGCCCAAAAGAATGCCACCCTCTTCTTCAGCATTTTGCTTCGCAGTACTTTGGCCAGTAAAAGGGTCTTGCAGGAATATAGACTTACAAAGGAAGCATTTGAGTGGGTTATTGGTGAAATAGAGTCACGATTCTTGCAGTCTTTAGTAGCACCTGGTGAGATGATAGGTTGTGTTGCTGCACAATCAATTGGTGAGCCTGCTACTCAAATGACTCTTAACACCTTCCACTATGCTGGTGTGAGTGCAAAGAATGTTACCCTTGGTGTTCCCAGGTTGAGGGAAATTATTAATGTAGCTAAGAAAATCAAAACACCTTCTCTTTCAGTCTACCTCTCTCCTGAAGCTAGTAAGACAAAGGAGAAGGCCAAGAATGTCCAATGTGCTTTAGAATACACTACTCTTCGGAGTGTTACTCATGCTACTGAAGTATGGTATGATCCAGACCCCACGAGCACAATTATTGAGGAGGATATTGACTTTGTTAAGTCCTACTATGAAATGCCAGATGAAGAGGTTGCCCCTGAAAAAATCTCCCCTTGGCTTCTGCGCATAGAGTTGAATCGCGAGATGATGGTTGATAAGAAGTTGAGTATGGCTGACATAGCTGAGAAGATCAATCTGGAATTTGATGACGATTTGACTTGCATATTTAACGATGACAATGCAGAGAAGCTGATCCTACGCATCCGTATCATGAATGATGAAGGCCCGAAGGGTGAACTGAATGATGAGTCTGCTGAAGATGATGTATTCTTGAAGAAGATTGAAAGCAACATGCTGACGGAAATGGCACTTCGAGGTATCCCAGACATCAACAAGGTGTTCATTAAACATAGTAAAGCAAGCAAGTTTGATGAGGCTGATGGATATAAGACTGGAGAGGAGTGGGTGTTGGATACAGAAGGTGTCAATCTCTTGGCTGTCATGTGCCACGAAGATGTTGATGCAAGGAGAACAACAAGCAATCACTTGATTGAAGTGATTGAAGTTCTTGGAATTGAAGCAGTCCGCCGGTCCTTGTTGGATGAATTGCGAGTTGTGATATCATTTGATGGGTCTTATGTGAATTATCGTCATCTGGCTATACTCTGTGATACAATGACCTATCGTGGCCACTTGATGGCTATCACCCGTCATGGCATCAACCGCAATGATACAGGGCCTATGATGAGGTGCTCATTTGAGGAAACAGTAGATATTCTCCTTGATGCTGCTGTCTATGCTGAGTCGGATTATTTGAGGGGTGTGACGGAGAATATAATGTTGGGTCAGCTTGCACCAATTGGAACAGGAGACTGTGCCTTGTATCTTAATGATGAGATGTTGAAGAATGCCATTGAACTTCAGCTACCTAGTTACATGGAAGGTCTGGAATTTGGCATGACGCCTGCTCGTTCTCCAGTGTCAGGAACTCCTTATCATGAAGGCATGATGTCTCCAAGTTATTTGCTGAGTCCAAATCTCCGTCTCTCACCCGTTACGGATGCTCAGTTCTCACCTTATGTAGGTGGGATGGCATTCTCTCCTACGTCGTCCCCTGGTTACAGCCCATCATCTCCAGGATACAGTCCATCCTCCCCTGGCTACAGTCCTACCTCTCCTGGTTATAGTCCTACATCTCCTGGGTATAGTCCCACCTCTCCTGGGTACAGCCCCACATCACCTACTTACAGTCCTAGTTCGCCAGGTTACAGTCCAACAAGTCCAGCATATTCTCCTACAAGTCCCTCTTATTCTCCGACCTCACCGAGCTACAGCCCTACCTCTCCAAGTTACAGCCCTACATCTCCAAGTTACAGTCCCACTTCACCAAGTTATAGTCCCACATCTCCCAGCTATAGTCCAACTTCACCTAGCTACAGCCCCACTTCACCCGTATATAGCCCAACTTCACCAGCATACAGCCCCACTTCACCCGCATATAGCCCCACTTCTCCATCATACAGCCCAACATCCCCATCATACAGCCCTACATCTCCTTCTTACAGCCCAACCTCACCCTCCTATAGCCCCACATCTCCATCATACAGCCCTACTTCACCAGCTTATAGCCCCACTTCTCCAGGGTACAGTCCAACCTCACCAAGTTATAGTCCCACGTCACCAAGCTATAGTCCTACCTCTCCAAGTTACAATCCACAGTCAGCCAAGTACAGTCCATCTCTTGCGTACTCACCAAGCAGTCCAAGGTTGTCTCCGTCAAGTCCTTATAGTCCAACTTCACCAAATTACAGCCCAACATCACCATCATATTCACCGACTTCTCCATCTTACTCCCCTTCAAGTCCAACATACAGTCCTAGCAGCCCTTATAATTCTGGAGTGAGTCCAGACTACAGCCCAAGTTCGCCACAGTACAGTCCAAGTGCTGGGTACTCCCCTAGTGCTCCAGGGTATTCACCATCATCAACTAGCCAGTACACCCCACAAACAAGCAACAAGGATGATCGGGCGACGAAGGATGATAGAAGCAGTAAAGATGACAGGAGTAAACGATAA |
Protein: MDLRFPYSPAEVAKVRMVQFGILSPDEIRQMSVVQIEHGETTERGKPKVGGLSDPRLGTIDRKMKCETCTANMAECPGHFGHLELAKPMFHIGFMKTVLSIMRCVCFNCSKILADEEEHKFKQALKIKNPKNRLKKILDACKNKSKCEGGDEIDVQGQDTEEPVKKSRGGCGAQQPKLSIDGMKMIAEYKPQRKRNDDQEQLPEPVERKQTLTAERVLSVLKRISDEDCQLLGLNPKFARPDWMILQVLPIPPPPVRPSVMMDTSSRSEDDLTHALAMIIRHNENLRRQERNGSPAHIISEFAQLLQFHVATYFDNELPGLPRATQRSGRPIKSICSRLKAKEGRIRGNLMGKRVDFSARTVITPDPNINIDELGVPWSIALNLTYPETVTPYNIERLKELVEYGPHPPPGKTGAKYIIRDDGQRLDLRYLKKSSDHHLELGYKVERHLNDGDFVLFNRQPSLHKMSIMGHRIRIMPYSTFRLNLSVTSPYNADFDGDEMNMHVPQSFETRAEVLELMMVPKCIVSPQSNRPVMGIVQDTLLGCRKITKRDTFIEKDVFMNILMWWEDFDGKVPAPAILKPRPLWTGKQVFNLIIPKQINLLRNSAWHSETETGFITPGDTQVRIEKGELLSGTLCKKALGTSSGSLIHVIWEEVGPDAARKFLGHTQWLVNYWLLQNAFSIGIGDTIADAATMEKINETISKAKEEVKNLIVKAQNKDLEPEPGRTMMESFENKVNQVLNKARDDAGNSAQKSLSESNNLKAMVTAGSKGSFINISQMTACVGQQNVEGKRIPFGFIDRTLPHFTKDDYGPESRGFVENSYLRGLTPQEFFFHAMGGREGLIDTAVKTSETGYIQRRLVKAMEDIMVKYDGTVRNSLGDVIQFLYGEDGMDSVWIESQKLDSLKMKKSEFDRVFRYNIDDESWNPTSYMLPEHIEDLRTIQELRDVFEAEVQKLEADRYQLGTEIAVTGDSNWPLPVNLKRLIWNAQKTFKVDFRRVSDLHPVEIVDSVDKLQERLKVVPGTDPLSVEAQKNATLFFSILLRSTLASKRVLQEYRLTKEAFEWVIGEIESRFLQSLVAPGEMIGCVAAQSIGEPATQMTLNTFHYAGVSAKNVTLGVPRLREIINVAKKIKTPSLSVYLSPEASKTKEKAKNVQCALEYTTLRSVTHATEVWYDPDPTSTIIEEDIDFVKSYYEMPDEEVAPEKISPWLLRIELNREMMVDKKLSMADIAEKINLEFDDDLTCIFNDDNAEKLILRIRIMNDEGPKGELNDESAEDDVFLKKIESNMLTEMALRGIPDINKVFIKHSKASKFDEADGYKTGEEWVLDTEGVNLLAVMCHEDVDARRTTSNHLIEVIEVLGIEAVRRSLLDELRVVISFDGSYVNYRHLAILCDTMTYRGHLMAITRHGINRNDTGPMMRCSFEETVDILLDAAVYAESDYLRGVTENIMLGQLAPIGTGDCALYLNDEMLKNAIELQLPSYMEGLEFGMTPARSPVSGTPYHEGMMSPSYLLSPNLRLSPVTDAQFSPYVGGMAFSPTSSPGYSPSSPGYSPSSPGYSPTSPGYSPTSPGYSPTSPGYSPTSPTYSPSSPGYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPVYSPTSPAYSPTSPAYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPAYSPTSPGYSPTSPSYSPTSPSYSPTSPSYNPQSAKYSPSLAYSPSSPRLSPSSPYSPTSPNYSPTSPSYSPTSPSYSPSSPTYSPSSPYNSGVSPDYSPSSPQYSPSAGYSPSAPGYSPSSTSQYTPQTSNKDDRATKDDRSSKDDRSKR |